Efficient Dimensionality Reduction for Canonical Correlation Analysis

نویسندگان

  • Haim Avron
  • Christos Boutsidis
  • Sivan Toledo
  • Anastasios Zouzias
چکیده

We present a fast algorithm for approximate canonical correlation analysis (CCA). Given a pair of tall-and-thin matrices, the proposed algorithm first employs a randomized dimensionality reduction transform to reduce the size of the input matrices, and then applies any CCA algorithm to the new pair of matrices. The algorithm computes an approximate CCA to the original pair of matrices with provable guarantees while requiring asymptotically fewer operations than the state-of-the-art exact algorithms.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cluster Canonical Correlation Analysis

In this paper we present cluster canonical correlation analysis (cluster-CCA) for joint dimensionality reduction of two sets of data points. Unlike the standard pairwise correspondence between the data points, in our problem each set is partitioned into multiple clusters or classes, where the class labels define correspondences between the sets. Cluster-CCA is able to learn discriminant low dim...

متن کامل

Methods of Canonical Analysis for Functional Data

We consider estimates for functional canonical correlations and canonical weight functions. Four computational methods for the estimation of functional canonical correlation and canonical weight functions are proposed and compared, including one which is a slight variation of the spline method proposed by Leurgans, Moyeed and Silverman (1993). We propose dimension reduction and dimension augmen...

متن کامل

Visualizing Web Images Using Fisher Discriminant Locality Preserving Canonical Correlation Analysis

A novel dimensionality reduction method, Fisher Discriminant Locality Preserving Canonical Correlation Analysis (FDLPCCA), for visualizing Web images is presented in this paper. FDLP-CCA can integrate two modalities and discriminate target items in terms of their semantics by considering unique characteristics of the two modalities. In this paper, we focus on Web images with text uploaded on So...

متن کامل

Shared Gaussian Process Latent Variables Models

A fundamental task is machine learning is modeling the relationship between different observation spaces. Dimensionality reduction is the task reducing the number of dimensions in a parameterization of a data-set. In this thesis we are interested in the cross-road between these two tasks: shared dimensionality reduction. Shared dimensionality reduction aims to represent multiple observation spa...

متن کامل

A Latent Variable Model for Two-Dimensional Canonical Correlation Analysis and its Variational Inference

Describing the dimension reduction (DR) techniques by means of probabilistic models has recently been given special attention. Probabilistic models, in addition to a better interpretability of the DR methods, provide a framework for further extensions of such algorithms. One of the new approaches to the probabilistic DR methods is to preserving the internal structure of data. It is meant that i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • SIAM J. Scientific Computing

دوره 36  شماره 

صفحات  -

تاریخ انتشار 2013